Automatic Categorization of Software Modules
نویسندگان
چکیده
The world of software has demonstrated the remarkable appeal of communal software development. Large number of software projects can leverage, reuse, and coordinate their work through internet and web-based technology. For example, Source-Forge currently hosts about sixty thousand software systems, similar strategies have suggested for corporate software development. With thousands of projects, manually locating related projects can be difficult. Hence to use automatic software categorization to find clusters of related software projects using only the source code from projects, automatic categorization of software experiments with a set of programs. Automatic categorization of software systems is a novel and intriguing challenge on software archive. Evolution has focused on determining intracomponent relations of given software system also increase to differentiate between categories. Function oriented produces better result than the object oriented. Automatic categorization of software has provided better results than LSA retrieval techniques in terms of Precision and Recall with multinomial Naïve Bayes scheme has outperformed all other approaches and shows better results than the existing approach (SVD), being used by some open source code repositories e.g. Source forge Hence, the tool can also be utilized for the automatic categorization of software components and this kind of automation may improve.
منابع مشابه
Mining Software Repositories for Defect Categorization
Early detection of software defects is very important to decrease the software cost and subsequently increase the software quality. Success of software industries not only depends on gaining knowledge about software defects, but largely reflects from the manner in which information about defect is collected and used. In software industries, individuals at different levels from customers to engi...
متن کاملAn Overview of Automatic Audio Segmentation
In this report we present an overview of the approaches and techniques that are used in the task of automatic audio segmentation. Audio segmentation aims to find changing points in the audio content of an audio stream. Initially, we present the basic steps in an automatic audio segmentation procedure. Afterwards, the basic categories of segmentation algorithms, and more specific the unsupervise...
متن کاملToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database
We report on the original implementation of named entity recognition (NER) modules based on an automatic text categorization pipeline, so-called ToxiCat (Toxicogenomic Categorizer), developed to perform biomedical documents classification and prioritization for the previous Biocreative campaign in order to speed up the curation of the Comparative Toxicogenomics Database (CTD). ToxiCat NER modul...
متن کاملA K-Means Based Clustering Approach for Finding Faulty Modules in Open Source Software Systems
Prediction of fault-prone modules provides one way to support software quality engineering. Clustering is used to determine the intrinsic grouping in a set of unlabeled data. Among various clustering techniques available in literature K-Means clustering approach is most widely being used. This paper introduces K-Means based Clustering approach for software finding the fault proneness of the Obj...
متن کاملAutomatic Categorization Tool for Open Software Repositories
The world of Open Source software has demonstrated the remarkable appeal of communal software development. Large number of software projects can leverage, reuse, and coordinate their work through Internet and web-based technology. For example, SourceForge currently hosts about sixty thousand software systems. Similar strategies have been suggested for corporate software development, through not...
متن کامل